Rank in Wordlist | Frequency | Word |
---|---|---|
2587 | 771 | 1,5 |
4380 | 466 | 2,5 |
6576 | 302 | 3,5 |
7479 | 263 | 1,2 |
8371 | 233 | 1,3 |
8624 | 226 | 1,8 |
8684 | 224 | 0,5 |
9633 | 200 | 4,5 |
11060 | 171 | 1,7 |
11118 | 170 | 1,6 |
Rank in Wordlist | Frequency | Word |
---|---|---|
224117 | 2 | .) |
Rank in Wordlist | Frequency | Word |
---|---|---|
40298 | 33 | 100% |
53762 | 22 | 70% |
57456 | 20 | 50% |
57465 | 20 | 80% |
66741 | 16 | 5% |
69596 | 15 | 30% |
72939 | 14 | 20% |
76540 | 13 | 10% |
76584 | 13 | 40% |
80742 | 12 | 90% |
Rank in Wordlist | Frequency | Word |
---|---|---|
52184 | 23 | G&G |
81673 | 12 | S&P |
105061 | 8 | H&M |
106101 | 8 | R&B |
157330 | 4 | AT&T |
162381 | 4 | P&G |
167029 | 4 | bit&Byte |
184924 | 3 | BY&N |
189871 | 3 | L&M |
194197 | 3 | S&T |
Rank in Wordlist | Frequency | Word |
---|---|---|
316126 | 1 | A$AP |
Rank in Wordlist | Frequency | Word |
---|---|---|
2556 | 780 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
9378 | 206 | Mike'as |
22296 | 73 | George'o |
31205 | 47 | Charles'is |
33287 | 43 | George'as |
50722 | 24 | Mike'o |
50776 | 24 | Pence'as |
59774 | 19 | Mike'u |
66978 | 16 | Gove'as |
73349 | 14 | Jerome'as |
73482 | 14 | M.Kimmage'as |
Rank in Wordlist | Frequency | Word |
---|---|---|
104286 | 8 | 5+1 |
124021 | 6 | 1+0 |
124103 | 6 | 17+1 |
138053 | 5 | 0+1 |
138271 | 5 | 2+2 |
138371 | 5 | 3+1 |
139386 | 5 | C+Pod |
156516 | 4 | 1+1 |
182497 | 3 | 1+2 |
182498 | 3 | 1+3 |
Rank in Wordlist | Frequency | Word |
---|---|---|
5659 | 357 | m/s |
8357 | 234 | km/val |
10043 | 191 | https://www |
12370 | 151 | km/h |
23038 | 70 | Eur/MWh |
25692 | 61 | km/val. |
30319 | 49 | ct/kWh |
31410 | 47 | kg/ha |
35307 | 40 | ir/ar |
40366 | 33 | Eur/mėn |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots